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We construct a class of network growth models based on local interactions on a metric space, capable 
of producing arbitrary degree distributions as well as a naturally high degree of clustering and assorta- 
tivity akin to certain biological networks. As a specific example, we study the case of random-walking 
agents who form bonds only when they meet at certain locations. The spatial distribution of these “ren¬ 
dezvous points” determines key characteristics of the network. For any arbitrary degree distribution, we 
are able to analytically solve for the required rendezvous point distribution. 


I. INTRODUCTION 

Many real-world networks are known to be scale-free and 
possess a very short average network distance (the so- 
called small-world property) (HITS). Traditionally, mod¬ 
els of scale-free networks such as the preferential at¬ 
tachment model of Barabasi and Albert (BA) and small- 
world networks such as Watts-Strogatz IIMl focus on re¬ 
producing topological properties of real-world networks 
without regard for the geometric character of the under¬ 
lying processes. However, in many real-world networks 
such as social networks, nodes are embedded in a met¬ 
ric space and links are generally established only if nodes 
make contact, either through physical proximity or in the 
virtual world m. It is therefore not surprising that in 
such networks, connection probabilities should fall with 
increasing physical distance. Examples include phone 
call Ill2l and scientific collaboration networks O. In¬ 
deed, a number of network growth models have been pro¬ 
posed in which link formation between agents depends 
on their distance in some abstract space ClIlllH. The 
agents themselves, however, lack physical dynamics in 
these models. 

In this paper, we propose a model capable of generating 
scale-free networks based on locally interacting dynamic 
agents residing in a metric space. A key feature of our 
model is our emphasis on the role of certain locations in 
space in promoting bond formation, the same way that 
the presence of meeting places such as universities, cafes, 
etc. facilitates the formation of new social links. Specif¬ 
ically, the agents stochastically traverse the space and 
form connections only when they encounter each other at 
designated meeting places. The global characteristics of 
the network are then determined by the spatial distribu¬ 
tion of these rendezvous points (RP). When viewed in re¬ 
verse time, a different interpretation of the model is pos¬ 
sible. Now, rather than being meeting places, the RPs 
are seeds from which independent stochastically moving 
agents are spawned. This branching process is reminis¬ 
cent of genetic evolution models where new genes and 
the proteins they encode (represented as points in some 


parameter space) are born through the duplications and 
subsequent mutations originating from an existing pool of 
genes miEeiiniiii. 

Another feature of our model is that it produces a rela¬ 
tively high global clustering coefficient akin to those ob¬ 
served in some biological networks, including neuron fir¬ 
ing correlations IjSj and protein-protein interactions IflSl . 
There exist models capable of producing arbitrary degree 
distributions or relatively high clustering (H [71 HOl fill , 
but BA-like models generally have low clustering unless 
substantially modified EOl . 

The framework we introduce here is very general. The 
model may be solved for agents moving according to a va¬ 
riety of different stochastic processes. For any such pro¬ 
cess, given any desired degree distribution, we can an¬ 
alytically solve the spatial rendezvous point distribution 
that results in that degree distribution. In this paper, we 
first demonstrate the procedure for a concrete example— 
namely, agents moving according to an isotropic random 
walk—and then discuss the general case. 


II. NETWORK OF INTERACTING RANDOM WALKERS 

Consider a flat 2D space with area V - L^. Place W » 1 
random walkers uniformly at random in this space. For 
simplicity we will work in units where ^ ^ 1- Let </>i(x, t) 
denote the probability density of finding random walker i 
at point X and time t. Thus, without any interaction the 
Fokker-Planck equation is the sourced diffusion, or heat 
equation 

ldt-Sl1)(Pi(x,t)-Ji{x,t) ( 1 ) 

where we require agent i to begin its walk at time t-to 
and position x^ by setting the source term to 

Ji{.x,t) = 6{.t-tQ)5‘^{.x-Xi). (2) 

With this source, the solutions of Q are in fact the re¬ 
tarded Green’s functions (f>i(.x,t) - G(.x,t;xi,to). In the 
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case of 2D diffusion, this is given by 


G(x,tx;y,ty) 


9(tx-ty) \x-y\^ 

-exp- 

47l(tx-ty) 4itx-ty) 


(3) 


Defining the operator J^x,t - and its conjugate 

.SfJj = -dt - we have 


^x,tGi(x, f; y, s) = J^’l^Gdx, t;y,s) 
- 6(t - s)6^(x - y) 


(4) 

(5) 


We assume that bonds are formed between two agents 
only when they meet at designated locations (coffee- 
shops, universities, work place, etc) in space, which we 
call “rendezvous points” or RP’s, characterized by a time- 
dependent spatial distribution Ffx, f). Once two agents 
meet at an RP, there is a small chance A that they form a 
bond. Therefore, to the lowest order in A, the probability 
that agents i and j have become connected by time T>to 
is given by 


Aij(to,T) =A 


dt6LxGi(.x,t;xi,to) 


( 6 ) 


X T(.x,t)G ,ix,t;xj,to) + 0(.X^). 


(7) 


The Aij may be interpreted either as elements of the 
weighted dense adjacency matrix of the network of con¬ 
nections, or as bond probabilities, in which case the ma¬ 
trix A defines an ensemble of unweighted random graphs. 


for the RP distribution required for an arbitrary degree 
distribution as we now proceed to do. 

If r is uniform over the space, each node will be over¬ 
whelmingly connected to those in its close vicinity, and the 
translational symmetry results in a sharply peaked de¬ 
gree distribution. Non-trivial degree distributions there¬ 
fore arise only when this symmetry is broken. Let us now 
focus on rotationally symmetric RP distributions r(r,f). 
With this symmetry the degree distribution P(.k) is an im¬ 
plicit function of r, since k is only a function of r. For P(.k) 
monotonic (possibly with cutoffs near k = 0 and ^max), we 
have 


P[^(r)]|d^(r)| = \dNir)\ 
Plkir)] = 


dN 

dN 

dk 

dk 

dr 

dr 


( 10 ) 

( 11 ) 


where dN(r) is the number of nodes in the annulus 
[r,r-i-dr]. The absolute value is necessary since dk/dr 
may be negative. This simple equation combined with 
@ allows us to explicitly calculate the degree distribu¬ 
tion given r(x, t) or conversely, to solve for r(x, t) given a 
desired degree distribution. As a simple example, with 
fo = 0 and a single rendezvous point activated at a single 
time, r(r, t) = S(.r)5(t - te), equations ([^ and ( (TI) yield 

P[k(r)]^47itee(T-te)k-^ ( 12 ) 


which is a power law distribution Pik) ck k ^ with expo¬ 
nent 7 = 1 . 


III. ANALYTICAL RESULTS 


IV. GENERAL POWER-LAW EXAMPLE 


With this simple linear equation many network charac¬ 
teristics can be computed analytically. In what follows 
we will first prove an important relation between degrees 
and the RP distribution r(x,t). Then we will outline the 
procedure which allows one to 1) derive the degree distri¬ 
bution when T{x,t) is given, and more importantly 2) find 
r(x,t) such that a desired degree distribution such as a 
power-law is obtained. 

The degrees ki are defined aski = TlJ^-yAij. Replacing the 
sum with an integral in the continuum limit and using 
/ d?XjGj{x, t;xj, to) - 1 for diffusion we obtain 

k{xi,to,T)-X f [ dtd?xGi{x,t\Xi,to)X'{x,t) (8) 

J J —OO 

where k{xi,to,T) is the degree, measured at time T, of an 
agent starting at position xi at time to- Applying .ifj, 
on both sides of (|8| thus yields the first important result 

.^l^k{x,t,T)^XB{T-t)nx,t) (9) 


We will now derive the conditions for r(r,t) for which the 
degree distribution becomes a power-law, possibly chang¬ 
ing over time with an overall factor p{T - to) and with an 
upper cutoff 

P(k-,to,T) = p{T-to)k-^, Ae[l,^max]. (13) 


The maximum degree ^max is chosen such that the 
expected number of nodes of degree kmax is one, i.e. 
f’(^max) = 1- Therefore from ( 


:^p(T-to)^^r. 


(14) 


Now, in order to solve for r(r, t), we integrate ( [TTl l to find 
k(r, t, T) and plug it in (|^. We obtain 


r(r,t) = if;^^ 


Ji(r-l)r^+piT-t)^^r 

piT-t) 


1 

1-7 


( 15 ) 


where 9(x) is the Heaviside step function. The signifi¬ 
cance of Eq. i) is in that it relates the node degrees to 

the RP distribution. This allows us for instance to solve ^ The lower cutoff will depend on L and T. 
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r(r,t)t 
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FIG. 1. The spatial degree function k{r,t,T) and RP distribution function T(,r,t) for various exponents in 2 spatial dimensions. The 
models are characterized by ^max = kmaxiT -1) taken here to be linear: ^maxls) tx s. 


For arbitrary y > 1, and 

= ip(r-t)exp 


Tir 


p(.T-t) 


(16) 


for j-l. 

The results for y = 1,2,3 and two different p(t) are given 
in Fig. Using these results, we can simulate the model 
by placing agents and RP’s on a finite area of the 2D 
space with appropriate distributions, and computing the 
Aij. To avoid boundary effects, the characteristic range 
of the random walkers cr = \/4T must be much smaller 
than the system size L. For the continuum approxima¬ 
tion to hold. O' must be much larger than the inter-agent 
distance L/\/N. With proper normalization, Aij may be 
interpreted as the probability that the unweighted edge 
(i,j) exists, and different realizations of the network can 
be constructed accordingly. 


A. Higher moments: assortativity and clustering 

The degree distribution is only one of many measures 
characterizing a graph. It is the distribution of the first 
moment ki - Y.jAij. The simplest among higher or¬ 
der measures of graph connectivity would be the degree- 
degree correlation, also known as degree assortativity, 
which compares the degree of the first neighbor of node 
i,k\,toki itself The average first neighbor degree is 

~ W - IP 

j J 

and is thus related to the square of the adjacency matrix. 
The next higher order measure which is related to A® is 
the global clustering coefficient C which measures the de¬ 
gree to which the graph is clustered IfISl 

^ _ 3 X # triangles 

# connected triplets 


which can be shown to be equal to C = Tr {A^\lY.ij y- 
Clustering may also be measured at the vertex level us¬ 
ing the local clustering Ci lfT5]l defined as the number of 
triangles involving node i divided by the total number of 
such triangles possible given the degree ki 


2 X # triangles containing i 
kiiki - 1 ) 


(18) 


By definition c; < 1. Fig. summarizes the results of 
simulations for scale-free distributions with y = 1,2,3. 
For each case, one realization of the unweighted random 
graph ensemble is generated and the degree distributions 
Plk), first neighbor degree-degree correlation {k^), and 
local clustering c{k) is shown. Interestingly, our model 
has a naturally high global clustering coefficient because 
agents close to the RP’s are all likely to connect and form 
close-knit subgraphs. Fig. [^illustrates how our model 
compares to a particular real world network, namely the 
network of protein-protein interactions in the nematode 
C. elegans (CE PPI) from the integrated dataset of dif¬ 
ferent types of interactions (incorporating WI8, litera¬ 
ture curated. Microarray, Phenotype, Interolog, and Ge¬ 
netic interactions) HSj , as well as a Barabasi-Albert (BA) 
network of similar size as the real data. The CE PPI 
network has a power-law degree distribution with power 
P(,k) ~ k~^. We therefore compare it with a y = 1 from our 
model. The PPI network has an average global cluster¬ 
ing of C = 0.47 versus our model’s C - 0.22. The BA (with 
the same number of nodes as PPI and with m = 25 to pro¬ 
duce similar density) on the other hand, has C - 0.03 and 
deviates significantly from the PPI data. In the first two 
moments, P{k) and {k^), our model matches the CE PPI 
almost perfectly. For clustering, our model exhibits a sim¬ 
ilar trend, but falls short in terms of magnitude. 
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Degree—degree Correlation 


Local Clustering 



FIG. 2. Top left: degree distributions of graphs generated from our model for power-law distributions with 7 = 1,2,3. Lines represent 
k~~f for 7 = 1,2,3 (To separate data points for clarity a constant is multiplied into the y-axis of the 3 datasets). Top middle and top 
right show average degree of first neighbors, <^i> vs degrees, and local clustering c(,k) for the 3 simulations, respectively. The curves 
are the means and the shaded area is one standard deviation above and below mean. Lower left: degree distribution of the C. elegans 
integrated protein interaction network ®. P(A)oc k ^ seems to match it well. Lower middle compares the degree assortativity of 
the C. elegans data (CE PPI) with our simulated 7 = 1 data, which almost perfectly reproduces the CE PPI statistics. The blue curve, 
BA, is a Barabasi-Albert network with m = 25 (to produce comparable density) which exhibits a different behavior from our model 
and most parts of the CE PPL Lower right compares the clustering of the three. Our model does seem to behave similar to the CE 
PPI, but it is no match for its extremely high clustering. Our global clustering for this 7 = 1 is C = 0.22 whereas the CE PPI has 
C = 0.47. Number of nodes and edges are 6.2 x 10^ and 1.8 x 10® in CE PPI and 5.4 x 10® and 2.1 x 10® in 7 = 1. 


V. DISCUSSION 

We showed that networks with fat-tailed degree distribu¬ 
tions and long range connections (scale-free networks are 
known to be “ultra small-world” IIH) can arise from local 
interactions, if the translation symmetry is broken. The 
framework we introduced here uses the familiar tools of 
classical field theory. One of our main results is that given 
any (monotonic) degree distribution, we can analytically 
compute the RP distribution resulting in a network with 
that degree distribution. 

While we demonstrated the derivations in the case of 
power-law distributions, other monotonic distributions 
can also be handled similarly. Furthermore, our model is 
generalizable to agent dynamics other than isotropic ran¬ 
dom walks, so long as the dynamics obeys a linear Fokker- 
Planck equation of the form - J{x,t). Finally, 

the model can be solved in higher spatial dimensions as 
well, with similar results. 

From the point of view of application, some real world 
networks, especially biological networks such as neuron 
firing correlation networks from fMRI measurements (Sj 
and protein interaction networks ® tend to have high 
global clustering coefficients (C > 20%). This is where 
many other scale-free network models such as Barabasi- 
Albert (BA) fall short and there have been many attempts 
to remedy this (Tl HH |20). One attractive feature of our 
model is that it has a naturally high global clustering. It 
also exhibits a degree-degree correlation pattern similar 
to biological data. 

It must be stressed that this model was not originally in¬ 


tended as a model of protein-protein interactions. Nev¬ 
ertheless, it contains important elements that might con¬ 
stitute the ingredients for such a model. Accumulating 
mutations may be conceived of as a random walk inside 
some parameter space. A core set of existing genes can 
be represented by a distribution Lfr, t). Genotypic diver¬ 
sification mechanisms such as gene duplication may then 
correspond to branching processes, a simple example of 
which we presented in our model. These elements to¬ 
gether with the partial empirical success of the model 
point to its potential utility as a starting point for mod¬ 
eling biological networks. 
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